Overview
What is Pentaho?
Pentaho is a suite of open source business intelligence and analytics products, now offered and supported by Hitachi Data Systems since the June 2015 acquisition.
Trust Pentaho to deliver a smooth data mining experience
Best for Reporting and Dashboard
Data integration made easy
Using the Pentaho tools to solve ETL challenges
Integrate and connect all data sources with Ai engineering in the cloud environment
My Experience with Pentaho data integrator
Robust platform that needs some refresher to compete with more modern tools
A great tool, but falls under the shadow of the larger players.
Pentaho for your business
Pentaho can be a great solution for a company
Pentaho was just OK as an OEM solution
Pentaho - Yay or Nay?
Pentaho: A great DW/BI soultion
Rookie Pentaho review from an ETL veteran
Pentaho - Enhancing our ETL Capabilities
Popular Features
- Publish to PDF (19)9.797%
- Role-Based Security Model (19)9.595%
- Multi-User Support (named login) (20)9.393%
- Formatting capabilities (19)8.383%
Pricing
What is Pentaho?
Pentaho is a suite of open source business intelligence and analytics products, now offered and supported by Hitachi Data Systems since the June 2015 acquisition.
Entry-level set up fee?
- No setup fee
Offerings
- Free Trial
- Free/Freemium Version
- Premium Consulting/Integration Services
Would you like us to let the vendor know that you want pricing?
136 people also want pricing
Alternatives Pricing
What is Microsoft Power BI?
Microsoft Power BI is a visualization and data discovery tool from Microsoft. It allows users to convert data into visuals and graphics, visually explore and analyze data, collaborate on interactive dashboards and reports, and scale across their organization with built-in governance and security.
What is SAP Lumira Discovery?
SAP Lumira Discovery is SAP’s data visualization and discovery application. It facilitates data discovery, visualization, and analysis by assisting users with creation of dashboards, infographics, presentations, data facets, tag clouds, and more.
Product Demos
Pentaho and Hadoop Demo
Adempiere Pentaho Demo - Products & Sales figures
Agile BI with Pentaho BI Suite Demo
Google Analytics (part 1/2) - Pentaho Data Integration Demo
PENTAHO DATA INTEGRATION TOOL DEMO
Control de Mando (Dashboard) con Pentaho de Matrix CPM Solutions
Features
BI Standard Reporting
Standard reporting means pre-built or canned reports available to users without having to create them.
- 8.7Pixel Perfect reports(18) Ratings
Pixel Perfect reports are highly-formatted reports with graphics and ability to preview the report before printing.
- 9.8Customizable dashboards(18) Ratings
Customizable dashboards are dashboards providing the builder some degree of control over the look and feel and display options.
- 8.6Report Formatting Templates(18) Ratings
Ad-hoc Reporting
Ad-Hoc Reports are reports built by the user to meet highly specific requirements.
- 7.7Drill-down analysis(18) Ratings
Drill down analysis is the ability to get to a further level of detail by going deeper into the hierarchy.
- 8.3Formatting capabilities(19) Ratings
Ability to format output e.g. conditional formatting, lines, headers, footers.
- 9.3Integration with R or other statistical packages(12) Ratings
Integration with the open-source R predictive modeling environment.
- 9.6Report sharing and collaboration(17) Ratings
Report sharing and collaboration is the ability to easily share reports with others.
Report Output and Scheduling
Ability to schedule and manager report output.
- 9.6Publish to Web(18) Ratings
- 9.7Publish to PDF(19) Ratings
- 9.6Report Versioning(13) Ratings
Report versioning is the assignment of version numbers to each version of a report to help in tracking.
- 9.9Report Delivery Scheduling(17) Ratings
Report Delivery Schedule is the ability to have reports delivered to a destination at a specific data and time.
- 9.3Delivery to Remote Servers(10) Ratings
Ability to deliver reports to remote servers
Data Discovery and Visualization
Data Discovery and Visualization is the analysis of multiple data sources in a search for patterns and outliers and the ability to represent the data visually.
- 7.9Pre-built visualization formats (heatmaps, scatter plots etc.)(16) Ratings
Pre-built visualization formats are canned visualization types that can be selected to visualize different kinds of data.
- 8.2Location Analytics / Geographic Visualization(16) Ratings
Location analytics is the visualization of geographical or spatial data.
- 8.3Predictive Analytics(14) Ratings
Predictive Analytics is the ability to build forecasting models based on existing data sets.
Access Control and Security
Access control means being able to determine who has access to which data.
- 9.3Multi-User Support (named login)(20) Ratings
Named model access means that users have access based on name and password.
- 9.5Role-Based Security Model(19) Ratings
Role-based access means that access to data is determined by job or position in the corporation.
- 9.9Multiple Access Permission Levels (Create, Read, Delete)(18) Ratings
Multiple access permission levels means that different levels of users have different rights.
- 7.7Single Sign-On (SSO)(10) Ratings
Allows users to use one set of login credentials to access multiple applications
Mobile Capabilities
Support for mobile devices like smartphones and tablets.
- 9.7Responsive Design for Web Access(10) Ratings
Web design aimed at producing easy-to-read sites across a range of different devices.
- 7Mobile Application(7) Ratings
A dedicated app for iOS and/or Android.
- 8.7Dashboard / Report / Visualization Interactivity on Mobile(11) Ratings
In-app dashboard reports and data visualization.
Application Program Interfaces (APIs) / Embedding
APIs are a set of routines, protocols, and tools for used for embedding one application in another
- 8.3REST API(10) Ratings
REST is an architecture style for designing networked applications
- 9Javascript API(9) Ratings
A Javascript API is a type of API
- 7.3iFrames(9) Ratings
An iFrame is an HTML document embedded inside another HTML document on a website
- 8.6Java API(9) Ratings
A Java application programming interface (API) is a list of all classes that are part of the Java development kit (JDK)
- 8.9Themeable User Interface (UI)(10) Ratings
A themeable user interface means that a specific visual them can be applied to it
- 9.6Customizable Platform (Open Source)(10) Ratings
A customizable, open source API Gateway is a fast and scalable type of API
Product Details
- About
- Integrations
- Tech Details
- FAQs
What is Pentaho?
Pentaho Integrations
Pentaho Technical Details
Operating Systems | Unspecified |
---|---|
Mobile Application | No |
Frequently Asked Questions
Comparisons
Compare with
Reviews and Ratings
(131)Community Insights
- Business Problems Solved
- Pros
- Cons
Pentaho has proven to be a valuable tool for users across a range of industries and business functions. Users have found value in using Pentaho for building data warehouses for data migration and analytics, covering all scenarios and allowing the use of external jars for unsupported activities. Pentaho's BI stack is utilized for ETL, report delivery, and as an endpoint for custom web apps. It offers ease of learning and impressive functionality, making it a popular choice for small-size private consulting firms. Pentaho is also used by engineering departments to create data warehouse environments for multiple customers, enabling analytics usage. In addition, it serves as the primary source for Business Intelligence in many companies, used by multiple users for creating reports and evaluating work across different teams. The software's ability to handle large and complex data sets with ease has been highly regarded by users, along with its advanced features for data integration, reporting, and analytical dashboards. Overall, Pentaho meets various needs such as scheduled ETL processes, data ingestion, reporting and dashboard development, making it a flexible solution across organizations.
Wide range of tools and features: Users appreciate the flexibility of Pentaho, as it offers a wide range of tools and features that can be tailored to meet the specific needs of different users and organizations. This has been mentioned by multiple reviewers who found this feature highly customizable and easy to learn.
Excellent reporting tool: Pentaho is praised for being an excellent reporting tool, with features like data reporting, integration, data mining, and ETL. Users find it intuitive and easy to use, even for advanced users. The visual interface simplifies processes that would traditionally require writing lines of code. Many reviewers have highlighted this aspect as one of the strengths of Pentaho.
Highly accessible data integration module: Pentaho's Data Integration module is highly regarded for its maturity and ease of learning. It allows business users to quickly connect to almost any data source. The ability to preview data in a pivot view format enables early data analysis. Several users have mentioned this module as a valuable feature in their reviews.
Limited Data Visualization Capabilities: Users have expressed the need for Pentaho to improve its data visualization capabilities in order to enhance the end result of dashboards. Specifically, some users feel that the current options are limited and lack advanced features and customization.
Difficulties with Mondrian-based ROLAP: Some users have mentioned that the Mondrian based ROLAP capability does not scale up well when analyzing data for a large number of subscribers over different time periods. This has resulted in performance issues and challenges in meeting the needs of complex business requirements.
Lack of Support and Guidance for WEKA: Users believe that WEKA, the machine learning platform used by Pentaho, lacks use cases in production environments and requires more support and guidance. They have found it challenging to implement and fully leverage the potential of WEKA within their organizations due to limited resources and documentation.
Attribute Ratings
Reviews
(1-20 of 20)Best for Reporting and Dashboard
- Reporting
- Integration
- Flexibility
- Cost
- DB connector
- High user friendly
Data integration made easy
- Data Integration with (almost) any source
- ETL jobs
- Manage and scheduling recurrent process
- Report development could be easier
- Better integration with online repository
- Native deployment and development environments
- Integrate and synchronize with big data easily
- Import data from any sources and different databases
- Managing data in on-premise, hybrid and cloud environments.
- Compatibility and flexibility of the platform with any type of scenario and any business or industry
- Various tools in the software suite to transformation of data
- Simple interface appearance and creative UI graphics
- It has good modules and they should work on more variety and new modules suitable for the service
- Initial configuration is a bit time consuming and complicated for novice users and need smart config wizard
- It is relatively stable, robust and reliable.
- Interactive analysis reports are very useful and relatively easy to use.
- Relatively simple to integrate with other systems
- The type of graphs and layouts proposed for diagrams and charts are a little bit limited and partly outdated.
- For certain reports you can feel that it is not using the latest web technologies.
- The tools for creating new report template designs could be improved.
If you want to create very nicely looking interactive graphs using the latest web technologies, there might be other tools that are more suited. We have also seen performance problems with large amounts of data, but it is not certain if this only was the fault of Pentaho but could also be the infrastructure around.
- The built-in ETL tools are easy to learn, and can quickly import and transform any data you have.
- The excellent visualizations and charts are pleasing to the eye, and looks are important in sales and marketing presentations.
- The rollout was fast, we installed the software and were building dashboards within minutes.
- I think the relative obscurity of the tool is a downside, not as many developers, consultants or peers you can tap into.
- Lack of a solid user community held us back, looking at Power BI and Qlik, they have huge user communities that help each other out. Would have liked that here.
- Smaller company means smaller sales force, and the lack of a local presence made it hard to only interact online with the account rep. Other companies have someone local who often stops by with pre-sales developers to just pitch in free of charge when they have time.
Pentaho can be a great solution for a company
- SQL query + scripting will help you design different objects, tables, pivots, dashboards, linked t other reports.
- Organizing in which way you want, synchronizing with AD is very helpful to set security/folder/group of users/single users.
- By using Eclipse you can set another layer of security by filtering information users's need to access.
- Sending reports y email also can be set up by security, by groups or list of users.
- Design seems pretty secure, from time to time can fail when there is too much design.
- Would like to have a format copy for different objects, tables, cross-tabs, grids, it would make design much more easier. Of course, we use template and CSS design for these objects, from time to time we design something different and it makes it much harder to do this manually for each similar object.
- Possibility to use multiple DB in the .xaction.
- Operational reports, big lists of COs, POs, DOs, master information and details. Bursting reports by email and scheduling tasks are interesting and very useful.
- Complex aggregations are useful to use in cross tabs and calculated functions in the tables. I would recommend a new tool if the purpose is dashboards, but they still can be created in Pentaho as well.
Pentaho - Yay or Nay?
- User friendly interface
- Easy and quick report generation
- Great report customizing features
- Heavy software to install
- Requires licensing to enable all features
- Difficult to integrate with other servers
Pentaho: A great DW/BI soultion
- First thing what I have experienced about Pentaho is that it is user friendly. The best thing about Pentaho is Pentaho Data Integration. I have never used more user friendlier ETL tool like PDI. All the jobs and transformation steps are easy to understand. And I like the sample transformations and jobs that are provided with the package. It is so user friendly that, even if you don't know SQL, it will generate it for you. If you don't want to write scripts, that's fine, you can do it in PDI.
- I have found that Pentaho can be integrated with any technology or framework. I have easily and successfully integrated it with HDFS, EMR, S3, CouchDB, many different RDBMSs. I would consider it as a strength of Pentaho. Also if you are stuck or you find any error, the type of logging will have an answer for you. I found the logging mechanism very effective.
- I have mostly used Pentaho Analyzer and Schema Workbench at the BI side. It is user friendly too and we have a very few users who come to the developers to help them understanding the UI of PUC.
- Most of the companies use star schema in their Data Warehouses but they are not the pure star schemas. There are the bridge tables, group tables but when using Schema Workbench to design a cube, it gets very painful for the developers to accommodate such schema in it. To do this, I have to go to the XML file and add the new elements. I would love to see the feature where Schema Workbench can accommodate the bridge tables as they are the part of star schema too.
- When it comes to ETL, I have found PDI to be the best tool, but at the report side, it is not as good as the other tools available in the market. Especially the users always complain about the graphs in the Pentaho Analyzer. I think. the UI needs a lot of improvement.
- PDI is slow reading the JSON files. There is a fast JSON input step available in the marketplace but I think I would be great if Pentaho can make the JSON reading even faster.
- When I export the repository, I see the files names are encoded with UTF-8 encoding. It would be great, if the spaces and the special characters can be preserved while exporting a BA repository.
Less Appropriate scenarios: When you have a star schemas with the bridge tables or snow-flake schemas, you will need a lot of additional work to be done in Pentaho apparently. Also, dealing with the files is not bad, but it should be improved.
** My review is for Pentaho 5.4.0.8 or previous releases.
A powerful ETL tool which is open source
- Populate relational database
- Transform and clean data
- Create periodic job and generate report
- Aggregate data
- It will be helpful to have modules supporting Google Adwords and Facebook API and Twilio API
- It has "add constant", but does not have "multiply constant" module.
- Unit transform module
Pentaho- An adequate BI solution for users on a budget!
- ETL, fairly wide support for a number of different data sources, a good API for writing plugins, and great out-of-the-box functionality.
- Community support and great documentation for using their tooling.
- Mondrian/OLAP, great engine for processing MDX queries.
- pentaho's analyzer tool/front-end. This doesn't come close to competing with products like Tableau.
- Pentaho Report Designer, this looks like something that was built in the early 90's and is extremely clunky to use for new-users
- Schema Workbench, would be nice to see better support for snowflake type schemas
Pentaho powerful backend BI tooling, hit-and-miss data visualization
- Pentaho Data Integration (PDI), which is Pentaho's ETL tool, is a powerful visual scripting tool. It is a very mature ETL tool that can process large quantities of data quickly when provided with appropriate hardware.
- Pentaho Analyzer which is Pentaho's Enterprise browser-based analysis and pivot table tool is powerful and intuitive. When provided with a well defined data warehouse schema it is easy for even non-technical users to quickly generate reports and graphs.
- Pentaho allows you to connect to virtually any datasource provided there exists a JDBC connector, REST API, or some other API end point.
- Pentaho has an open-source Community Edition which provides much of the functionality of the Enterprise Edition without any licensing fees.
- A major problem we have had with Pentaho is their enterprise licensing. As a client who understands very well what is offered within the free Community Edition we felt their mark up on their enterprise features was much too high. They wanted to charge us for migrating from MySQL to Amazon RedShift since RedShift is an "analytical database" while nearly every other database could be connected to for free. Due to licensing concerns we terminated our enterprise license.
- Pentaho's visualization tools are very capable, but have a very steep learning curve and engineering cost. Due to this we ended switching from Pentaho for the majority of our internal dashboarding and reporting to purchasing Tableau licenses. If Pentaho could improve their visualization and dashboarding capabilities, they could truly be an end-to-end BI solution.
An implementor view on Pentaho tools.
- The main strength of Pentaho (PDI) is that it is free. No need for additional license approvals to implement something quickly using the UI tool.
- It has Pentaho Reporting - the ability to use reporting on top of PDI transformations is a powerful feature. PDI can handle data extraction, and with reporting you can create a simple report on this data. They all just work with each other without additional integration required.
- And most of Pentaho projects are open-source. That helps a lot when debugging something.
- Help documentation is an area for improvements. A lot of features and side effects just not documented.
- PDI uses swt. It has a lot of troubles with this dependency. It is run on Java, but with swt - and you are stuck with a platform that supports swt. ARM architecture does not have swt support out of the box.
- Online jobs/transformations creation. Jobs/transformations are just xml. So the lack of online editor is just 'it was not implemented yet'.
- Strange /Pentaho-solutions placement. Application is not deployed on a server in a standard way. Instead - a server should be embedded into a 'Pentaho solution' - very strange architecture.
One of the best BI solutions for start up companies or those looking for a highly customizable tool.
- automated reports
- dashboards
- data-integration
- Big Data
- Poor support
- few books published in relation to Pentaho
- Pentaho User Console could be more user friendly
B.I. made simple. Pentaho rocks!
- The Data Integration tool is fantastic. A novice user can get up to speed quickly with it and the GUI is intuitive
- Support. As an enterprise customer, I am always thrilled with the fast turnaround on support tickets that I open.
- Job Scheduling. The Job scheduler is easy to use and very reliable
- Data Analyzer Tool in "User Console". The drag and drop UI makes it easy for me end users to see the data in whatever way suits them. They can even GEO-MAP the data instantly!!
- Speed on ETL. Without a very complex setup, the ETL Job runs single threaded and can be slow for BIG jobs.
- The "Report Designer" is decent, but lacks a lot of control that you might get from more mature products (like Business Objects, for instance). Sometimes to want to rotate or reformat a chart label, and it can't be done. In other cases, it can. It is not consistent. Also, 3D charts don't work in certain circumstances. Finally, charts mapped over time, in some cases, don't have the X Axis auto-scale and so all the data points are not readable.
Pentaho - Jumping On The Band Wagon
- A data analyst has quick access to data that would otherwise need to be pulled straight for the database. Additionally, the data dictionary is tied to an object so up keep is easy and users can scroll over the object to exactly what that measure or object represents.
- Report presentation is clean and professional. Pentaho comes with a pretty extensive set of charting and graphing functionality out of the box that in other tools would have to be developed. Easy file output with sufficient types available to choose from. (.pdf, .xlsx, .csv)
- The Data Integration tool is without question the best tool in the set. There is no limit to your transformation options. You can transform virtually any text file, excel file or script into a table, spreadsheet or even report. Many times we are given list of patients and file formats vary from practice to practice. With Pentaho DI we have the flexibility to take what has been provided and standardize.
- Pentaho could use some improvement of the integration of ldap into the User Console. We have unsuccessfully been able to deploy the Pentaho User Console to partners because it does not support a federated ldap model. Even attempts at integrating to our primary ldap has proven fruitless.
- The Pentaho Suite as a whole tends to be a resource hog. Since we work with such large data sets we often find that large requests hang up or get an "oops error" which is a standard Pentaho error when the data set is too large. It would be nice to have some indication of our limits so that we are not guessing or we can limit the data set up front.
- Upgrades on the Suite are a bit cumbersome for our Administrators. There is not a standard backup and install package so upgrades often involve our administrator on the phone with Pentaho during the upgrades to walk through a custom install.
Three Years with Pentaho and No Regrets
We use it for report design and delivery. The designer tool is fairly straightforward and once reports are complete they are immediately accessible to our end users.
Pentaho is used for Business Intelligence which includes totals, trends, and forecasts of sales and subscriptions of our services. Pentaho's web-based analysis tool allows our end-users to drag & drop measures and dimensions and instantly convert raw data into a variety of graphical representations. Before we had Pentaho, each view might have required a long query or static report that was hand-exported to Excel. With Pentaho's analysis tool, our sales, finance, and marketing teams have instant insight and infinitely flexible views of our data.
In the IT department, we use Pentaho as a scheduling engine for data maintenance jobs as its interface allows us to manage multiple shards from a single source.
- Pentaho works as advertised -- it provides a truly drag-and-drop customizable interface for your data.
- The Enterprise Suite has everything you need to start working and showing results right away.
- Pentaho's customer support has been responsive and timely when we've had issues.
- The Report Designer interface can seem clunky at times when trying to perform additional calculations on previously calculated fields or pass values to subreports.
- There is inconsistency within the tools in Data Integration -- it is clear the various tools were authored by different engineers at different times. It is not always obvious what goes where.
- There is inconsistency between all the products within the Pentaho Suite. Each has a different look and feel and none work the same way.
- The product is not without flaws, but so far none have been insurmountable.
- Upgrading to the latest version has proven to be EXTREMELY challenging for both us AND Pentaho customer services.
Are your Finance and Marketing teams constantly asking for one-off queries? Pentaho puts this power in THEIR hands so they can pull and arrange the data they want into whatever view they want.
Do you want to pay too much for a full-featured BI solution? Pentaho is very reasonably priced, and even has a free version!
Pentaho the best BI integrator
- Data Integration. Pentaho wins hands down. You can read huge data using a Hadoop process, do your encirchment, load it to a Netezza database afterwards, finally input the data to your WEKA model to predict which Customer will churn or what offer should be made to the customer so that he/she stays put.
- BI Server. If you want to schedule a data read operation of your clickstream data, to finally burst out recommendations of next best actions to end users, Pentaho's BI Server performs this integration seamlessly.
- Mondrian. This layer provides Cube based Hierarchical data modeling on the front end and at the back end converts this OLAP structure to a ROLAP, SQL based model. Hence any relational database becomes a ROLAP engine seamlessly.
- Data Visualization. Provide richer library of data visualization capabilities. Ultimately the dashboards are the end result of all the hard work done at the back office. Yes, Pentaho has a lower TCO compared to other Products but richer data visualization capabilities would make it a winner! Pentaho has overcome this limitation by allowing external charting engines to be integrated with their product suite, but more needs to be done to strengthen core Pentaho Data Visualizaiton capabilities. Alternatively the external charting engine capabilities need to be documented and evangalized.
- Alternative to Mondrian. In our case we needed to analyse data for a million subscribers over key performance areas like Churns, Activations etc. In these scenarios semi additive measures needed to be calculated and presented in a report across days to years grains of time. In such scenarios the Mondrian based ROLAP capability did not scale up to our expectation. Pentaho needs to address such issues and fast.
- In the world of R/SAS/SPSS its hard to find use cases where WEKA was used in Production environments to solve a business problem. We would have needed some hand holding to replace our R/SPSS code to WEKA and help us build newer alogorithms on this platform ground up.
Pentaho is Java based, so no shortage of skilled Java resources who can help you integrate. We did not spend time building our own reporting engine, Data Visualization layer, Security layer, workflow capabilities. We simply used our proven transactional product and used Pentaho to fill our gaps.
Pentaho: a strong open source BI suite
- The Pentaho Data Integration tool is extremely versatile. I find it easier to use than comparable tools like SSIS.
- The reporting engine delivers reports in multiple formats, including Excel and PDF.
- It is easy for users to subscribe to reports for delivery by email.
- The report parameterization is very flexible, and I find it easier to use and more versatile than parameterization in Crystal.
- If using the community edition, be prepared to invest some effort in learning the product. Documentation can be vague.
- The new pivot table viewer is part of the enterprise edition. Fortunately there is a nice open source plug in called Saiku that can replace the old, difficult to use jPivot pivot table interface.
- I wish there were easier ways to audit user activity, especially to see which items have not been accessed for a long time and could be retired.
- There are not as many people who know how to use Pentaho in the market as there are people proficient in other BI tools. There is a learning curve and design patterns are not necessarily the same from one BI tool to the next.
Pentaho Report Designer Integated with Odoo v7!
- Connect via JDBC
- Integrates with Odoo (formerly OpenERP)
- Grouping, summing seem to work well even though they are not intuitive as far as set up goes.
- The multiple report output types are great (PDF, CSV, Excel, Word, Text, etc)
- Connect via the web client of Odoo
- Easy to build in parameters for filtering in the report
- Report designer seems to be buggy on Windows 7
- The interface for the report designer is out dated and difficult to navigate or use.
- I would like to see a tool tip when hovering over options
Pentaho Open Source
- For an open source tool the documentation was fairly good and the support reasonable
- The tool was relatively easy to learn without a lot of training.
- We did have some issues getting the automation to work. I believe this was 'Kettle'. Other than that, not too many bugs.